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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: VOLKER MITTENDORF 

(B) STREET: Institut de Biologie et Physiologie Vegetales 

(C) CITY: Batiment de Biologie 

(D) STATE: Lausanne 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : CH-1015 

(G) TELEPHONE: (41) (21) 692-4222 

(H) TELEFAX: (41) (21) 692-4195 

(A) NAME: YVES POIRIER 

(B) STREET: Institut de Biologie et Physiologie Vegetales 

(C) CITY: Batiment de Biologie 

(D) STATE: Lausanne 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : CH-1015 

(G) TELEPHONE: (41) (21) 692-4222 

(H) TELEFAX: (41) (21) 692-4195 

(ii) TITLE OF INVENTION: BIOSYNTHESIS OF MEDIUM CHAIN LENGTH 
POLYHYDROXYALKANOATES 

(iii) NUMBER OF SEQUENCES: 26 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
ATGAGTCAGA AGAACAATAA CGAGCTTCCC AAGCAAGCCG CGGAAAACAC GCTGAACCTG 
AATCCGGTGA TCGGCATCCG GGGCAAGGAC CTGCTCACCT CCGCGCGCAT GGTCCTGCTC 
CAGGCGGTGC GCCAGCCGCT GCACAGCGCC AGGCACGTGG CGCATTTCAG CCTGGAGCTG 
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(C) STIWCEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Gin Lys Asn Asn Asn Glu Leu Pro Lys Gin Ala Ala Glu Asn 
15 10 15 

Thr Leu Asn Leu Asn Pro Val lie Gly lie Arg Gly Lys Asp Leu Leu 
20 25 30 

Thr Ser Ala Arg Met Val Leu Leu Gin Ala Val Arg Gin Pro Leu His 
35 40 45 

Ser Ala Arg His Val Ala His Phe Ser Leu Glu Leu Lys Asn Val Leu 
50 55 60 

Leu Gly Gin Ser Glu Leu Arg Pro Gly Asp Asp Asp Arg Arg Phe Ser 
65 70 75 80 

Asp Pro Ala Trp Ser Gin Asn Pro Leu Tyr Lys Arg Tyr Met Gin Thr 
85 90 95 

Tyr Leu Ala Trp Arg Lys Glu Leu His Ser Trp lie Ser His Ser Asp 
100 105 110 

Leu Ser Pro Gin Asp lie Ser Arg Gly Gin Phe Val lie Asn Leu Leu 
115 120 125 

Thr Glu Ala Met Ser Pro Thr Asn Ser Leu Ser Asn Pro Ala Ala Val 
130 135 140 

Lys Arg Phe Phe Glu Thr Gly Gly Lys Ser Leu Leu Asp Gly Leu Gly 
145 150 155 160 

His Leu Ala Lys Asp Leu Val Asn Asn Gly Gly Met Pro Ser Gin Val 
165 170 175 

Asp Met Asp Ala Phe Glu Val Gly Lys Asn Leu Ala Thr Thr Glu Gly 
180 185 190 

Ala Val Val Phe Arg Asn Asp Val Leu Glu Leu lie Gin Tyr Arg Pro 
195 200 205 

He Thr Glu Ser Val His Glu Arg Pro Leu Leu Val Val Pro Pro Gin 
210 215 220 

He Asn Lys Phe Tyr Val Phe Asp Leu Ser Pro Asp Lys Ser Leu Ala 
225 230 235 240 

Arg Phe Cys Leu Arg Asn Gly Val Gin Thr Phe He Val Ser Trp Arg 
245 250 255 

Asn Pro Thr Lys Ser Gin Arg Glu Trp Gly Leu Thr Thr Tyr He Glu 
260 265 270 



Ala Leu Lys Glu Ala He Glu Val Val Leu Ser He Thr Gly Ser Lys 
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2751^ 280 ^^285 



Asp Leu Asn Leu Leu Gly Ala Cys Ser Gly Gly lie Thr Thr Ala Thr 
290 295 300 

Leu Val Gly His Tyr Val Ala Ser Gly Glu Lys Lys Val Asn Ala Phe 
305 310 315 320 

Thr Gin Leu Val Ser Val Leu Asp Phe Glu Leu Asn Thr Gin Val Ala 
325 330 335 

Leu Phe Ala Asp Glu Lys Thr Leu Glu Ala Ala Lys Arg Arg Ser Tyr 
340 345 350 

Gin Ser Gly Val Leu Glu Gly Lys Asp Met Ala Lys Val Phe Ala Trp 
355 360 365 

Met Arg Pro Asn Asp Leu lie Trp Asn Tyr Trp Val Asn Asn Tyr Leu 
370 375 380 

Leu Gly Asn Gin Pro Pro Ala Phe Asp lie Leu Tyr Trp Asn Asn Asp 
385 390 395 400 

Thr Thr Arg Leu Pro Ala Ala Leu His Gly Glu Phe Val Glu Leu Phe 
405 410 415 

Lys Ser Asn Pro Leu Asn Arg Pro Gly Ala Leu Glu Val Ser Gly Thr 
420 425 430 

Pro lie Asp Leu Lys Gin Val Thr Cys Asp Phe Tyr Cys Val Ala Gly 
435 440 445 

Leu Asn Asp His lie Thr Pro Trp Glu Ser Cys Tyr Lys Ser Ala Arg 
450 455 460 

Leu Leu Gly Gly Lys Cys Glu Phe lie Leu Ser Asn Ser Gly His lie 
465 470 475 480 

Gin Ser lie Leu Asn Pro Pro Gly Asn Pro Lys Ala Arg Phe Met Thr 
485 490 495 

Asn Pro Glu Leu Pro Ala Glu Pro Lys Ala Trp Leu Glu Gin Ala Gly 
500 505 510 

Lys His Ala Asp Ser Trp Trp Leu His Trp Gin Gin Trp Leu Ala Glu 
515 520 525 

Arg Ser Gly Lys Thr Arg Lys Ala Pro Ala Ser Leu Gly Asn Lys Thr 
530 535 540 

Tyr Pro Ala Gly Glu Ala Ala Pro Gly Thr Tyr Val His Glu Arg 
545 550 555 

INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 
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15 



20 



35 



40 



45 



50 



55 



(A) L^^^H: 1680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGCGAGAAA AGCAGGAATC GGGTAGCGTG CCGGTGCCCG CCGAGTTCAT GAGTGCACAG 60 

AGCGCCATCG TCGGCCTGCG CGGCAAGGAC CTGCTGACGA CGGTCCGCAG CCTGGCTGTC 120 

CACGGCCTGC GCCAGCCGCT GCACAGTGCG CGGCACCTGG TCGCCTTCGG AGGCCAGTTG 180 

GGCAAGGTGC TGCTGGGCGA CACCCTGCAC CAGCCGAACC CACAGGACGC CCGCTTCCAG 240 

GATCCATCCT GGCGCCTCAA TCCCTTCTAC CGGCGCACCC TGCAGGCCTA CCTGGCGTGG 3 00 

CAGAAACAAC TGCTCGCCTG GATCGACGAA AGCAACCTGG ACTGCGACGA TCGCGCCCGC 360 

p GCCCGCTTCC TCGTCGCCTT GCTCTCCGAC GCCGTGGCAC CCAGCAACAG CCTGATCAAT 420 

€l 25 

ill CCACTGGCGT TAAAGGAACT GTTCAATACC GGCGGGATCA GCCTGCTCAA TGGCGTCCGC 480 

m 

m CACCTGCTCG AAGACCTGGT GCACAACGGC GGCATGCCCA GCCAGGTGAA CAAGACCGCC 540 

yj TTCGAGATCG GTCGCAACCT CGCCACCACG CAAGGCGCGG TGGTGTTCCG CAACGAGGTG 6 00 

^ CTGGAGCTGA TCCAGTACAA GCCGCTGGGC GAGCGCCAGT ACGCCAAGCC CCTGCTGATC 660 

GTGCCGCCGC KQ^KUCM^Qhl^ GTACTACATC TTCGACCTGT CGCCGGAAAA GAGCTTCGTC 720 

CAGTACGCCC TGAAGAACAA CCTGCAGGTC TTCGTCATCA GTTGGCGCAA CCCCGACGCC 780 

CAGCACCGCG AATGGGGCCT GAGCACCTAT GTCGAGGCCC TCGACCAGGC CATCGAGGTC 840 

AGCCGCGAGA TCACCGGCAG CCGCAGCGTG AACCTGGCCG GCGCCTGCGC CGGCGGGCTC 900 

ACCGTAGCCG CCTTGCTCGG CCACCTGCAG GTGCGCCGGC AACTGCGCAA GGTCAGTAGC 960 

GTCACCTACC TGGTCAGCCT GCTCGACAGC CAGATGGAAA GCCCGGCGAT GCTCTTCGCC 102 0 

GACGAGCAGA CCCTGGAGAG CAGCAAGCGC CGCTCCTACC AGCATGGCGT GCTGGACGGG 108 0 

CGCGACATGG CCAAGGTGTT CGCCTGGATG CGCCCCAACG ACCTGATCTG GAACTACTGG 1140 

GTCAACAACT ACCTGCTCGG CAGGCAGCCG CCGGCGTTCG ACATCCTCTA CTGGAACAAC 1200 

GACAACACGC GGCTGCCCGC GGCGTTCCAC GGCGAACTGC TCGACCTGTT CAAGCACAAC 1260 

CCGCTGACCC GCCCGGGCGC GCTGGAGGTC AGCGGGACCG CGGTGGACCT GGGCAAGGTG 1320 



GCGATCGACA GCTTCCACGT CGCCGGCATC ACCGACCACA TCACGCCCTG GGACGCGGTG 1380 
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TATCGCTCGG 


CCCTCCTGCT 


GGGCGGCCAG 


CGCCGCTTCA 


TCCTGTCCAA 


CAGCGGGCAC 


1440 


ATCCAGAGCA 


TCCTCAACCC 


TCCCGGAAAC 


CCCAAGGCCT 


GCTACTTCGA 


GAACGACAAG 


1500 


CTGAGCAGCG 


ATCCACGCGC 






GCGAAGAGGG 


CAGCTGGTGG 


1560 


CCGGTCTGGC 


TGGGCTGGCT 


GCAGGAGCGC 


TCGGGCGAGC 


TGGGCAACCC 


TGACTTCAAC 


1620 


CTTGGCAGCG 


CCGCGCATCC 


GCCCCTCGAA 


GCGGCCCCGG 


GCACCTACGT 


GCATATACGC 


1680 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Arg Glu Lys Gin Glu Ser Gly Ser Val Pro Val Pro Ala Glu Phe 
15 10 15 

Met Ser Ala Gin Ser Ala He Val Gly Leu Arg Gly Lys Asp Leu Leu 
20 25 30 

Thr Thr Val Arg Ser Leu Ala Val His Gly Leu Arg Gin Pro Leu His 
35 40 45 

Ser Ala Arg His Leu Val Ala Phe Gly Gly Gin Leu Gly Lys Val Leu 
50 55 60 

Leu Gly Asp Thr Leu His Gin Pro Asn Pro Gin Asp Ala Arg Phe Gin 
65 70 75 80 

Asp Pro Ser Trp Arg Leu Asn Pro Phe Tyr Arg Arg Thr Leu Gin Ala 
85 90 95 

Tyr Leu Ala Trp Gin Lys Gin Leu Leu Ala Trp He Asp Glu Ser Asn 
100 105 110 

Leu Asp Cys Asp Asp Arg Ala Arg Ala Arg Phe Leu Val Ala Leu Leu 
115 120 125 

Ser Asp Ala Val Ala Pro Ser Asn Ser Leu He Asn Pro Leu Ala Leu 
130 135 140 

Lys Glu Leu Phe Asn Thr Gly Gly He Ser Leu Leu Asn Gly Val Arg 
145 150 155 160 
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His Leu Leu 



Asp Leu Val His Asn Gly Gly Ml^R»ro Ser Gin Val 

165 170 175 



Asn Lys Thr Ala Phe Glu lie Gly Arg Asn Leu Ala Thr Thr Gin Gly 
180 185 190 

Ala Val Val Phe Arg Asn Glu Val Leu Glu Leu lie Gin Tyr Lys Pro 
195 200 205 

Leu Gly Glu Arg Gin Tyr Ala Lys Pro Leu Leu lie Val Pro Pro Gin 
210 215 220 

lie Asn Lys Tyr Tyr lie Phe Asp Leu Ser Pro Glu Lys Ser Phe Val 
225 230 235 240 

Gin Tyr Ala Leu Lys Asn Asn Leu Gin Val Phe Val lie Ser Trp Arg 
245 250 255 

Asn Pro Asp Ala Gin His Arg Glu Trp Gly Leu Ser Thr Tyr Val Glu 
260 265 270 

Ala Leu Asp Gin Ala lie Glu Val Ser Arg Glu lie Thr Gly Ser Arg 
275 280 285 

Ser Val Asn Leu Ala Gly Ala Cys Ala Gly Gly Leu Thr Val Ala Ala 
290 295 300 

Leu Leu Gly His Leu Gin Val Arg Arg Gin Leu Arg Lys Val Ser Ser 
305 310 315 320 

Val Thr Tyr Leu Val Ser Leu Leu Asp Ser Gin Met Glu Ser Pro Ala 
325 330 335 

Met Leu Phe Ala Asp Glu Gin Thr Leu Glu Ser Ser Lys Arg Arg Ser 
340 345 350 

Tyr Gin His Gly Val Leu Asp Gly Arg Asp Met Ala Lys Val Phe Ala 
355 360 365 

Trp Met Arg Pro Asn Asp Leu lie Trp Asn Tyr Trp Val Asn Asn Tyr 
370 375 380 

Leu Leu Gly Arg Gin Pro Pro Ala Phe Asp lie Leu Tyr Trp Asn Asn 
385 390 395 400 

Asp Asn Thr Arg Leu Pro Ala Ala Phe His Gly Glu Leu Leu Asp Leu 
405 410 415 

Phe Lys His Asn Pro Leu Thr Arg Pro Gly Ala Leu Glu Val Ser Gly 
420 425 430 

Thr Ala Val Asp Leu Gly Lys Val Ala He Asp Ser Phe His Val Ala 
435 440 445 

Gly He Thr Asp His He Thr Pro Trp Asp Ala Val Tyr Arg Ser Ala 
450 455 460 
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m 



Leu Leu Leu Gly Gly Gin Arg Arg Phe lie Leu Ser Asn Ser Gly His 
465 470 475 480 

lie Gin Ser lie Leu Asn Pro Pro Gly Asn Pro Lys Ala Cys Tyr Phe 
485 490 495 

Glu Asn Asp Lys Leu Ser Ser Asp Pro Arg Ala Trp Tyr Tyr Asp Ala 
500 505 510 

Lys Arg Glu Glu Gly Ser Trp Trp Pro Val Trp Leu Gly Trp Leu Gin 
515 520 525 

Glu Arg Ser Gly Glu Leu Gly Asn Pro Asp Phe Asn Leu Gly Ser Ala 
15 530 535 540 

Ala His Pro Pro Leu Glu Ala Ala Pro Gly Thr Tyr Val His lie Arg 
545 550 555 560 



10 



20 



(2) INFORMATION FOR SEQ ID NO : 5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

35 GGAGAATTCC CGATGAGCCA GAAGAACAA 29 

(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

50 

CTGGAAGCTT TTGATCGTTC ATGCACGTA 29 

(2) INFORMATION FOR SEQ ID NO: .7: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 
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(B) TY^Mtiucleic acid 

(C) STR^mEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTGG7U\TTCA TGCGTGAAAA GCAGGAATC 29 
10 (2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 
GGCCAAGCTT TTGAGCGTAT ATGCACGTA 2 9 

(2) INFORMATION FOR SEQ ID NO: 9: 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1731 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

"'"^ 40 ATGGCTGCAT CTTTCTCTGT CCCCTCTATG ATCATGGAAG AGGAAGGGAG ATTTGAGGCG 60 

GAAGTTGCGG AAGTGCAGAC TTGGTGGAGC TCAGAGAGGT TCAAGCTAAC AAGGCGTCCT 120 

TACACGGCCC GTGACGTGGT GGCTCTACGT GGTCATCTCA AGCAAGGTTA TGCTTCGAAC 180 

45 

GAGATGGCTA AGAAGCTGTG GAGAACGCTC AAGAGTCACC AAGTCAACGG CACGGCGTCT 240 

CGCACGTTTG GTGCCTTGGA CCCTGTTCAG GTGACAATGA TGGCTAAACA TTTAGACACC 300 

50 ATTTATGTCT CTGGTTGGCA GTGCTCGTCT ACTCACACCT CCACTAACGA GCCTGGTCCG 360 

GATCTTGCTG ACTATCCATA CGATACCGTT CCTAACAAGG TCGAACATCT CTTCTTCGCT 420 

CAGCAGTACC ATGACAGAAA ACAGAGGGAG GCGAGAATGA GCATGAGCAG AGAAGAAAGA 480 



55 



GCAAAAACTC CGTTTGTGGA CTACTTGAAG CCCATCATCG CCGACGGAGG AACCGGCTTC 540 
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15 



35 

111 



GGCGGTACCA CTGCCACCGT AAAACTCTGC AAACTCTTCG TTGAAAGAGG AGCCGCTGGG 600 

GTCCACATCG AGGACCAGTC CTCCGTCACC AAGAAGTGTG GCCACATGGC CGGAAAAGTC 660 

5 

CTCGTGGCAG TCAGTGAACA CATCAACCGC CTTGTTGCGG CTCGGCTCCA GTTCGACGTG 720 

ATGGGCACAG AGACCGTCCT GGTCGCTAGA ACGGACGCGG TCGCGCCCAC TCTGATCCAA 780 

10 TCGTU^CATTG ACTCAAGGGA CCACCAGTTC ATCCTCGGTG TCACTAACCC AAACCTTAGA 840 

GGCT^GAGTT TGTCCTCGCT TCTGGCCGAG GGAATGGCTG TAGGCAATAA TGGTCCAGCG 900 

TTGCAAGCGA TTGAGGATCA ATGGCTTAGC TCAGCTCGTC TCATGACTTT CTCGGACGCT 960 

GTCGTGGAGG CTCTCAAGCG CATGAACCTA AGTGAGAATG AGAAGAGCCG GAGAGTGACC 102 0 

GAGTGGCTAA TCCATGCAAG GTACGAGAAC TGCCTTTCAA ACGAGCAAGG CCGAGAATTA 108 0 

20 GCAGCAAAAC TCGGTGTGAC TGATCTTTTC TGGGACTGGG ACTTGCCCAG AACCAGAGAA 1140 

GGATTCTACC GGTTCCAAGG CTCGGTCACA GCAGCCGTGG TCCGTGGCTG GGCCTTTGCA 1200 

CAGATAGCTG ATCTCATCTG GATGGAAACC GCAAGCCCTG ACCTCAACGA ATGCACCCAA 126 0 

25 

TTCGCAGAAG GAGTCAAGTC CAAGACACCA GAGGTAATGC TCGCCTACAA CCTCTCCCCA 132 0 

TCCTTCAACT GGGACGCTTC TGGTATGACG GATCAGCAGA TGATGGAGTT CATTCCACGA 138 0 

30 ATCGCCAGGC TCGGTTATTG CTGGCAGTTT ATAACCCTTG CGGGTTTCCA TGCGGATGCT 1440 

CTTGTGGTCG ATACGTTTGC AAAGGATTAC GCGAGGAGAG GGATGCTGGC TTATGTCGAG 1500 

AGGATACAGA GAGAAGAGAG GAGCAATGGG GTTGACACAT TGGCTCATCA GAAATGGTCA 1560 

GGTGCTAATT ACTATGATCG TTATCTTAAG ACCGTCCAAG GTGGAATCTC CTCCACTGCA 1620 

GCCATGGGCA AAGGTGTTAC CGAGGAACAA TTCAAAGAGA CCTGGACGAG GCCGGGAGCT 1680 

40 GCTGGAATGG GCGAAGGGAC TAGCCTTGTG GTGGCCAAGT CCAGAATGTA A 1731 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 576 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ala Ser Phe Ser Val Pro Ser Met lie Met Glu Glu Glu Gly 
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1 



5 



10 



15 ' 



Arg Phe Glu Ala Glu Val Ala Glu Val Gin Thr Trp Trp Ser Ser Glu 
20 25 30 

Arg Phe Lys Leu Thr Arg Arg Pro Tyr Thr Ala Arg Asp Val Val Ala 
35 40 45 

Leu Arg Gly His Leu Lys Gin Gly Tyr Ala Ser Asn Glu Met Ala Lys 
50 55 60 

Lys Leu Trp Arg Thr Leu Lys Ser His Gin Val Asn Gly Thr Ala Ser 
65 70 75 80 

Arg Thr Phe Gly Ala Leu Asp Pro Val Gin Val Thr Met Met Ala Lys 
85 90 95 

His Leu Asp Thr lie Tyr Val Ser Gly Trp Gin Cys Ser Ser Thr His 
100 105 110 

Thr Ser Thr Asn Glu Pro Gly Pro Asp Leu Ala Asp Tyr Pro Tyr Asp 
115 120 125 

Thr Val Pro Asn Lys Val Glu His Leu Phe Phe Ala Gin Gin Tyr His 
130 135 140 

Asp Arg Lys Gin Arg Glu Ala Arg Met Ser Met Ser Arg Glu Glu Arg 
145 150 155 160 

Ala Lys Thr Pro Phe Val Asp Tyr Leu Lys Pro lie lie Ala Asp Gly 
165 170 175 

Gly Thr Gly Phe Gly Gly Thr Thr Ala Thr Val Lys Leu Cys Lys Leu 
180 185 190 

Phe Val Glu Arg Gly Ala Ala Gly Val His He Glu Asp Gin Ser Ser 
195 200 205 

Val Thr Lys Lys Cys Gly His Met Ala Gly Lys Val Leu Val Ala Val 
210 215 220 

Ser Glu His He Asn Arg Leu Val Ala Ala Arg Leu Gin Phe Asp Val 
225 230 235 240 

Met Gly Thr Glu Thr Val Leu Val Ala Arg Thr Asp Ala Val Ala Pro 
245 250 255 

Thr Leu He Gin Ser Asn He Asp Ser Arg Asp His Gin Phe He Leu 
260 265 270 

Gly Val Thr Asn Pro Asn Leu Arg Gly Lys Ser Leu Ser Ser Leu Leu 
275 280 285 

Ala Glu Gly Met Ala Val Gly Asn Asn Gly Pro Ala Leu Gin Ala He 
290 295 300 
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Glu Asp Glr^^ Leu Ser Ser Ala Arg Leu Met "^^^Phe Ser Asp Ala 

305 310 315 320 

Val Val Glu Ala Leu Lys Arg Met Asn Leu Ser Glu Asn Glu Lys Ser 
325 330 335 

Arg Arg Val Thr Glu Trp Leu He His Ala Arg Tyr Glu Asn Cys Leu 
340 345 350 

Ser Asn Glu Gin Gly Arg Glu Leu Ala Ala Lys Leu Gly Val Thr Asp 
355 360 365 

Leu Phe Trp Asp Trp Asp Leu Pro Arg Thr Arg Glu Gly Phe Tyr Arg 
370 375 380 

Phe Gin Gly Ser Val Thr Ala Ala Val Val Arg Gly Trp Ala Phe Ala 
385 390 395 400 

Gin He Ala Asp Leu He Trp Met Glu Thr Ala Ser Pro Asp Leu Asn 
405 410 415 

Glu Cys Thr Gin Phe Ala Glu Gly Val Lys Ser Lys Thr Pro Glu Val 
420 425 430 

Met Leu Ala Tyr Asn Leu Ser Pro Ser Phe Asn Trp Asp Ala Ser Gly 
435 440 445 

Met Thr Asp Gin Gin Met Met Glu Phe He Pro Arg He Ala Arg Leu 
450 455 460 

Gly Tyr Cys Trp Gin Phe He Thr Leu Ala Gly Phe His Ala Asp Ala 
465 470 475 480 

Leu Val Val Asp Thr Phe Ala Lys Asp Tyr Ala Arg Arg Gly Met Leu 
485 490 495 

Ala Tyr Val Glu Arg He Gin Arg Glu Glu Arg Ser Asn Gly Val Asp 
500 505 510 

Thr Leu Ala His Gin Lys Trp Ser Gly Ala Asn Tyr Tyr Asp Arg Tyr 
515 520 525 

Leu Lys Thr Val Gin Gly Gly He Ser Ser Thr Ala Ala Met Gly Lys 
530 535 540 

Gly Val Thr Glu Glu Gin Phe Lys Glu Thr Trp Thr Arg Pro Gly Ala 
545 550 555 560 

Ala Gly Met Gly Glu Gly Thr Ser Leu Val Val Ala Lys Ser Arg Met 
565 570 575 



INFORMATION FOR SEQ ID NO : 11: . 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
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(B) T^Hf: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACTGAAGCTT TGGGCAAAGG TGTTAC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GTGGTCTAGA AGTTTTTCTG CGAAGATG 
(2) INFORjyiATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGCAAAGGTG TTACCGAGGA ACAATTCAAA GAGACCTGGA CGAGGCCGGG AGCTGCTGGA 
ATGGGCGAAG GGACTAGCCT TGTGGTGGCC AAGTCCAGAA TG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Lys Gly Val Thr Glu Glu Gin Phe Lys Glu Thr Trp Thr Arg Pro 
5 15 10 15 

Gly Ala Ala Gly Met Gly Glu Gly Thr Ser Leu Val Val Ala Lys Ser 
20 25 30 

10 Arg Met 



(2) INFORMATION FOR SEQ ID NO: 15: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ill 

ifl ATGAGCCAGA AGAACAATAA CGAGCTTCCC AAGCAAGCCG CGGAAAACAC GCTGAACCTG 60 



III 



AATCCGGTGA TCGGCATCCG GGGCAAGGAC CTGCTCACCT CCGCGCGCAT GGTCCTGCTC 12 0 

30 

CAGGCGGTGC GCCAGCCGCT GCACAGCGCC AGGCACGTGG CGCATTTCAG CCTGGAGCTG 180 

AAGAACGTCC TGCTCGGCCA GTCGGAGCTA CGCCCAGGCG ATGACGACCG ACGCTTTTCC 240 

= s 

35 GATCCGGCCT GGAGCCAGAA TCCACTGTAC AAGCGCTACA TGCAGACCTA CCTGGCCTGG 300 

fy 

"'^^ CGCAAGGAGC TGCACAGCTG GATCAGCCAC AGCGACCTGT CGCCGCAGGA CATCAGTCGT 360 

o 

O GGCCAGTTCG TCATCAACCT GCTGACCGAG GCGATGTCGC CGACCAACAG CCTGAGCAAC 420 
40 

CCGGCGGCGG TCAAGCGCTT CTTCGAGACC GGCGGCAAGA GCCTGCTGGA CGGCCTCGGC 480 

CACCTGGCCA AGGACCTGGT GAACAACGGC GGGATGCCGA GCCAGGTGGA CATGGACGCC 540 

45 TTCGAGGTGG GCAAGAACCT GGCCACCACC GAGGGCGCCG TGGTGTTCCG CAACGACGTG 600 

CTGGAACTGA TCCAGTACCG GCCGATCACC GAGTCGGTGC ACGAACGCCC GCTGCTGGTG 660 

GTGCCGCCGC AGATCAACAA GTTCTACGTC TTCGACCTGT CGCCGGACAA GAGCCTGGCG 720 

50 

CGCTTCTGCC TGCGCAACGG CGTGCAGACC TTCATCGTCA GTTGGCGCAA CCCGACCAAG 780 

TCGCAGCGCG AATGGGGCCT GACCACCTAT ATCGAGGCGC TCAAGGAGGC CATCGAGGTA 840 

55 GTCCTGTCGA TCACCGGCAG CAAGGACCTC AACCTCCTCG GCGCCTGCTC CGGCGGGATC 900 
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irf 



10 



15 



20 



35 



40 



rC^^5G CCACTACGTG GCCAGCGGCG AGAAG^^^l 



ACCACCGCGA CCCTG^RgG CCACTACGTG GCCAGCGGCG AGAAGAAGGT CAACGCCTTC 960 

ACCCAACTGG TCAGCGTGCT CGACTTCGAA. CTGAATACCC AGGTCGCGCT GTTCGCCC-AC 1020 

GAGAAGACTC TGGAGGCCGC CAAGCGTCGT TCCTACCAGT CCGGCGTGCT GGAGGGCAAG 1080 

GACATGGCGA AGGTGTTCGC CTGGATGCGC CCCAACGACC TGATCTGGAA CTACTGGGTC 1140 

AACAACTACC TGCTCGGCAA CCAGCCGCCG GCGTTCGACA TCCTCTACTG GAACAACGAC 1200 

ACCACGCGCC TGCCCGCCGC GCTGCACGGC GAGTTCGTCG AACTGTTCAA GAGCAACCCG 1260 

CTGAACCGCC CCGGCGCCCT GGAGGTCTCC GGCACGCCCA TCGACCTGAA GCAGGTGACT 1320 

TGCGACTTCT ACTGTGTCGC CGGTCTGAAC GACCACATCA CCCCCTGGGA GTCGTGCTAC 1380 

AAGTCGGCCA GGCTGCTGGG TGGCAAGTGC GAGTTCATCC TCTCCAACAG CGGTCACATC 1440 

CAGAGCATCC TCAACCCACC GGGCAACCCC AAGGCACGCT TCATGACCAA TCCGGAACTG 1500 

CCCGCCGAGC CCAAGGCCTG GCTGGAACAG GCCGGCAAGC ACGCCGACTC GTGGTGGTTG 1560 

CACTGGCAGC AATGGCTGGC CGAACGCTCC GGCAAGACCC GCAAGGCGCC CGCCAGCCTG 1620 

25 GGCAACAAGA CCTATCCGGC CGGCGAAGCC GCGCCCGGAA CCTACGTGCA TGAACGA 1677 
ffl (2) INFORMATION FOR SEQ ID NO: 16: 



IJl (i) SEQUENCE CHARACTERISTICS: 

y 30 (A) LENGTH: 1680 base pairs 

,p (B) TYPE: nucleic acid 

J (C) STRANDEDNESS: single 

fl (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATGCGTGAAA AGCAGGAATC GGGTAGCGTG CCGGTGCCCG CCGAGTTCAT GAGTGCACAG 60 

AGCGCCATCG TCGGCCTGCG CGGCAAGGAC CTGCTGACGA CGGTCCGCAG CCTGGCTGTC 12 0 

45 CACGGCCTGC GCCAGCCGCT GCACAGTGCG CGGCACCTGG TCGCCTTCGG AGGCCAGTTG 180 

GGCAAGGTGC TGCTGGGCGA CACCCTGCAC CAGCCGAACC CACAGGACGC CCGCTTCCAG 24 0 

GATCCATCCT GGCGCCTCAA TCCCTTCTAC CGGCGCACCC TGCAGGCCTA CCTGGCGTGG 300 

50 

CAGAAACAAC TGCTCGCCTG GATCGACGAA AGCAACCTGG ACTGCGACGA TCGCGCCCGC 360 

GCCCGCTTCC TCGTCGCCTT GCTCTCCGAC GCCGTGGCAC CCAGCAACAG CCTGATCAAT 420 

55 CCACTGGCGT TAAAGGAACT GTTCAATACC GGCGGGATCA GCCTGCTCAA TGGCGTCCGC 480 

-73- 



10 



20 



30 



At^^GT GCACAACGGC GGCATGCCCA GCCAC^l^ 



CACCTGCTCG AAGAI^^GT GCACAACGGC GGCATGCCCA GCCAgSIcAA CAAGACCGCC 54 0 

TTCGAGATCG GTCGCAACCT CGCCACCACG CAAGGCGCGG TGGTGTTCCG CAACGAGGTG 600 

CTGGAGCTGA TCCAGTACAA GCCGCTGGGC GAGCGCCAGT ACGCCAAGCC CCTGCTGATC 660 

GTGCCGCCGC AGATCAACAA GTACTACATC TTCGACCTGT CGCCGGAAAA GAGCTTCGTC 720 

CAGTACGCCC TGAAGAACAA CCTGCAGGTC TTCGTCATCA GTTGGCGCAA CCCCGACGCC 780 

CAGCACCGCG AATGGGGCCT GAGCACCTAT GTCGAGGCCC TCGACCAGGC CATCGAGGTC 840 

AGCCGCGAGA TCACCGGCAG CCGCAGCGTG AACCTGGCCG GCGCCTGCGC CGGCGGGCTC 900 

15 ACCGTAGCCG CCTTGCTCGG CCACCTGCAG GTGCGCCGGC AACTGCGCAA GGTCAGTAGC 960 

GTCACCTACC TGGTCAGCCT GCTCGACAGC CAGATGGAAA GCCCGGCGAT GCTGTTCGCC 1020 

GACGAGCAGA CCCTGGAGAG CAGCAAGCGC CGCTCCTACC AGCATGGCGT GCTGGACGGG 1080 

CGCGACATGG CCAAGGTGTT CGCCTGGATG CGCCCCAACG ACCTGATCTG GAACTACTGG 1140 

GTCAACAACT ACCTGCTCGG CAGGCAGCCG CCGGCGTTCG ACATCCTCTA CTGGAACAAC 1200 

Q 

=0 25 GACAACACGC GGCTGCCCGC GGCGTTCCAC GGCGAACTGC TCGACCTGTT CAAGCACAAC 1260 

IJl 

fli CCGCTGACCC GCCCGGGCGC GCTGGAGGTC AGCGGGACCG CGGTGGACCT GGGCAAGGTG 1320 

\n GCGATCGACA GCTTCCACGT CGCCGGCATC ACCGACCACA TCACGCCCTG GGACGCGGTG 1380 

TATCGCTCGG CCCTCCTGCT GGGCGGCCAG CGCCGCTTCA TCCTGTCCAA CAGCGGGCAC 1440 

ATCCAGAGCA TCCTCAACCC TCCCGGAAAC CCCAAGGCCT GCTACTTCGA GAACGACAAG 1500 



ir\ 

35 CTGAGCAGCG ATCCACGCGC CTGGTACTAC GACGCCAAGC GCGAAGAGGG CAGCTGGTGG 1560 

fy 

CCGGTCTGGC TGGGCTGGCT GCAGGAGCGC TCGGGCGAGC TGGGCAACCC TGACTTCAAC 1620 

o 

U CTTGGCAGCG CCGCGCATCC GCCCCTCGAA GCGGCCCCGG GCACCTACGT GCATATACGC 1680 

40 

(2) INFORMATION FOR SEQ ID NO : 17: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATGAGCCAGA AGAACAATAA CGAGCTTCCC AAGCAAGCCG CGGAAAACAC GCTGAACCTG 60 

-74- 



10 



15 



20 



25 

11! 



30 



m 

m 35 



40 



45 



50 



55 



AATCCGGTGA TCGGCATCCG GGGCAAGGAC CTGCTCACCT CCGCGCGCAT GGTCCTGCTC 12 0 

CAGGCGGTGC GCCAGCCGCT GCACAGCGCC AGGCACGTGG CGCATTTCAG CCTGGAGCTG 180 

AAGAACGTCC TGCTCGGCCA GTCGGAGCTA CGCCCAGGCG ATGACGACCG ACGCTTTTCC 240 

GATCCGGCCT GGAGCCAGAA TCCACTGTAC AAGCGCTACA TGCAGACCTA CCTGGCCTGG 3 00 

CGCAAGGAGC TGCACAGCTG GATCAGCCAC AGCGACCTGT CGCCGCAGGA CATCAGTCGT 360 

GGCCAGTTCG TCATCAACCT GCTGACCGAG GCGATGTCGC CGACCAACAG CCTGAGCAAC 420 

CCGGCGGCGG TCAAGCGCTT CTTCGAGACC GGCGGCAAGA GCCTGCTGGA CGGCCTCGGC 480 

CACCTGGCCA AGGACCTGGT GAACAACGGC GGGATGCCGA GCCAGGTGGA CATGGACGCC 540 

TTCGAGGTGG GCAAGAACCT GGCCACCACC GAGGGCGCCG TGGTGTTCCG CAACGACGTG 600 

CTGGAACTGA TCCAGTACCG GCCGATCACC GAGTCGGTGC ACGAACGCCC GCTGCTGGTG 660 

GTGCCGCCGC AGATCAACAA GTTCTACGTC TTCGACCTGT CGCCGGACAA GAGCCTGGCG 720 

CGCTTCTGCC TGCGCAACGG CGTGCAGACC TTCATCGTCA GTTGGCGCAA CCCGACCAAG 780 

TCGCAGCGCG AATGGGGCCT GACCACCTAT ATCGAGGCGC TCAAGGAGGC CATCGAGGTA 840 

GTCCTGTCGA TCACCGGCAG CAAGGACCTC AACCTCCTCG GCGCCTGCTC CGGCGGGATC 900 

ACCACCGCGA CCCTGGTCGG CCACTACGTG GCCAGCGGCG AGAAGAAGGT CAACGCCTTC 960 

ACCCAACTGG TCAGCGTGCT CGACTTCGAA CTGAATACCC AGGTCGCGCT GTTCGCCGAC 1020 

GAGAAGACTC TGGAGGCCGC CAAGCGTCGT TCCTACCAGT CCGGCGTGCT GGAGGGCAAG 1080 

GACATGGCCA AGGTGTTCGC CTGGATGCGC CCCAACGACC TGATCTGGAA CTACTGGGTC 1140 

AACAACTACC TGCTCGGCAA CCAGCCGCCG GCGTTCGACA TCCTCTACTG GAACAACGAC 1200 

ACCACGCGCC TGCCCGCCGC GCTGCACGGC GAGTTCGTCG AACTGTTCAA GAGCAACCCG 1260 

CTGAACCGCC CCGGCGCCCT GGAGGTCTCC GGCACGCCCA TCGACCTGAA GCAGGTGACT 1320 

TGCGACTTCT ACTGTGTCGC CGGTCTGAAC GACCACATCA CCCCCTGGGA GTCGTGCTAC 1380 

AAGTCGGCCA GGCTGCTGGG TGGCAAGTGC GAGTTCATCC TCTCCAACAG CGGTCACATC 144 0 

CAGAGCATCC TCAACCCACC GGGCAACCCC AAGGCACGCT TCATGACCAA TCCGGAACTG 1500 

CCCGCCGAGC CCAAGGCCTG GCTGGAACAG GCCGGCAAGC ACGCCGACTC GTGGTGGTTG 1560 

CACTGGCAGC AATGGCTGGC CGAACGCTCC GGCAAGACCC GCAAGGCGCC CGCCAGCCTG 1620 

GGCAACAAGA CCTATCCGGC CGGCGAAGCC GCGCCCGGAA CCTACGTGCA TGAACGATCA 1680 

AAAGCTTTGG GCAAAGGTGT TACCGAGGAA CAATTCAAAG AGACCTGGAC GAGGCCGGGA 1740 

-75- 



GCTGCTGGAA TGGGCGAAGG GACTAGCCTT GTGGTGGCCA AGTCCAGAAT G 1791 
(2) INFORMATION FOR SEQ ID NO: 18: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : 

10 (D) TOPOLOGY: linear 



15 



20 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ser Gin Lys Asn Asn Asn Glu Leu Pro Lys Gin Ala Ala Glu Asn 
1 5 10 15 

Thr Leu Asn Leu Asn Pro Val lie Gly lie Arg Gly Lys Asp Leu Leu 
20 25 30 



M Thr Ser Ala Arg Met Val Leu Leu Gin Ala Val Arg Gin Pro Leu His 

^fl 25 35 40 45 

yi 

Ser Ala Arg His Val Ala His Phe Ser Leu Glu Leu Lys Asn Val Leu 

fu 50 55 60 

Spj H 

HjJ 30 Leu Gly Gin Ser Glu Leu Arg Pro Gly Asp Asp Asp Arg Arg Phe Ser 

65 70 75 80 

Asp Pro Ala Trp Ser Gin Asn Pro Leu Tyr Lys Arg Tyr Met Gin Thr 
85 90 95 



Tyr Leu Ala Trp Arg Lys Glu Leu His Ser Trp lie Ser His Ser Asp 
100 105 110 



Leu Ser Pro Gin Asp lie Ser Arg Gly Gin Phe Val lie Asn Leu Leu 
40 115 120 125 

Thr Glu Ala Met Ser Pro Thr Asn Ser Leu Ser Asn Pro Ala Ala Val 
130 135 140 

45 Lys Arg Phe Phe Glu Thr Gly Gly Lys Ser Leu Leu Asp Gly Leu Gly 

145 150 155 160 

His Leu Ala Lys Asp Leu Val Asn Asn Gly Gly Met Pro Ser Gin Val 
165 170 175 

50 

Asp Met Asp Ala Phe Glu Val Gly Lys Asn Leu Ala Thr Thr Glu Gly 
180 185 190 

Ala Val Val Phe Arg Asn Asp Val Leu Glu Leu lie Gin Tyr Arg Pro 
55 195 200 205 

-76- 



lie Thr Glu Val His Glu Arg Pro Leu Leu Vl^^al Pro Pro Gin 

210 215 220 

lie Asn Lys Phe Tyr Val Phe Asp Leu Ser Pro Asp Lys Ser Leu Ala 
225 230 235 240 

Arg Phe Cys Leu Arg Asn Gly Val Gin Thr Phe lie Val Ser Trp Arg 
245 250 255 

Asn Pro Thr Lys Ser Gin Arg Glu Trp Gly Leu Thr Thr Tyr lie Glu 
260 265 270 

Ala Leu Lys Glu Ala lie Glu Val Val Leu Ser lie Thr Gly Ser Lys 
275 280 285 

Asp Leu Asn Leu Leu Gly Ala Cys Ser Gly Gly lie Thr Thr Ala Thr 
290 295 300 

Leu Val Gly His Tyr Val Ala Ser Gly Glu Lys Lys Val Asn Ala Phe 
305 310 315 320 

Thr Gin Leu Val Ser Val Leu Asp Phe Glu Leu Asn Thr Gin Val Ala 
325 330 335 

Leu Phe Ala Asp Glu Lys Thr Leu Glu Ala Ala Lys Arg Arg Ser Tyr 
340 345 350 

Gin Ser Gly Val Leu Glu Gly Lys Asp Met Ala Lys Val Phe Ala Trp 
355 360 365 

Met Arg Pro Asn Asp Leu lie Trp Asn Tyr Trp Val Asn Asn Tyr Leu 
370 375 380 

Leu Gly Asn Gin Pro Pro Ala Phe Asp lie Leu Tyr Trp Asn Asn Asp 
385 390 395 400 

Thr Thr Arg Leu Pro Ala Ala Leu His Gly Glu Phe Val Glu Leu Phe 
.405 410 415 

Lys Ser Asn Pro Leu Asn Arg Pro Gly Ala Leu Glu Val Ser Gly Thr 
420 425 430 

Pro lie Asp Leu Lys Gin Val Thr Cys Asp Phe Tyr Cys Val Ala Gly 
435 440 445 

Leu Asn Asp His lie Thr Pro Trp Glu Ser Cys Tyr Lys Ser Ala Arg 
450 455 460 

Leu Leu Gly Gly Lys Cys Glu Phe lie Leu Ser Asn Ser Gly His lie 
465 470 475 480 

Gin Ser lie Leu Asn Pro Pro Gly Asn Pro Lys Ala Arg Phe Met Thr 
485 490 495 

Asn Pro Glu Leu Pro Ala Glu Pro Lys Ala Trp Leu Glu Gin Ala Gly 
500 505 510 

-77- 



(n 

ill 



10 



Lys His Ala Asp Ser Trp Trp Leu His Trp Gin Gin Trp Leu Ala Glu 
515 520 525 

Arg Ser Gly Lys Thr Arg Lys Ala Pro Ala Ser Leu Gly Asn Lys Thr 
530 535 540 

Tyr Pro Ala Gly Glu Ala Ala Pro Gly Thr Tyr Val His Glu Arg Ser 
545 550 555 560 

Lys Ala Leu Gly Lys Gly Val Thr Glu Glu Gin Phe Lys Glu Thr Trp 
565 570 575 

Thr Arg Pro Gly Ala Ala Gly Met Gly Glu Gly Thr Ser Leu Val Val 
15 580 585 590 

Ala Lys Ser Arg Met 
595 

20 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1794 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATGCGTGAAA AGCAGGAATC GGGTAGCGTG CCGGTGCCCG CCGAGTTCAT GAGTGCACAG 60 

i]1 35 

lU AGCGCCATCG TCGGCCTGCG CGGCAAGGAC CTGCTGACGA CGGTCCGCAG CCTGGCTGTC 120 

CACGGCCTGC GCCAGCCGCT GCACAGTGCG CGGCACCTGG TCGCCTTCGG AGGCCAGTTG 180 

40 GGCAAGGTGC TGCTGGGCGA CACCCTGCAC CAGCCGAACC CACAGGACGC CCGCTTCCAG 240 

GATCCATCCT GGCGCCTCAA TCCCTTCTAC CGGCGCACCC TGCAGGCCTA CCTGGCGTGG 300 

CAGAAACAAC TGCTCGCCTG GATCGACGAA AGCAACCTGG ACTGCGACGA TCGCGCCCGC 360 

45 

GCCCGCTTCC TCGTCGCCTT GCTCTCCGAC GCCGTGGCAC CCAGCAACAG CCTGATCAAT 42 0 

CCACTGGCGT TAAAGGAACT GTTCAATACC GGCGGGATCA GCCTGCTCAA TGGCGTCCGC 480 

50 CACCTGCTCG AAGACCTGGT GCACAACGGC GGCATGCCCA GCCAGGTGAA CAAGACCGCC 540 

TTCGAGATCG GTCGCAACCT CGCCACCACG CAAGGCGCGG TGGTGTTCCG CAACGAGGTG 600 

CTGGAGCTGA TCCAGTACAA GCCGCTGGGC GAGCGCCAGT ACGCCAAGCC CCTGCTGATC 660 



55 



GTGCCGCCGC AGATCAACAA GTACTACATC TTCGACCTGT CGCCGGAAAA GAGCTTCGTC 72 0 

-78- 







CAGTACGCCC 


TGAAGAACAA 


CCTGCAGGTC 


TTCGTCATCA 


GTTGGCGC;^ 


CCCCGACGCC 


f o u 




5 


CAGCACCGCG 


AATGGGGCCT 


GAGCACCTAT 


GTCGAGGCCC 


TCGACCAGGC 


CATCGAGGTC 


ft 4 n 

O 1 \J 






AGCCGCGAGA 


TCACCGGCAG 


CCGCAGCGTG 


AACCTGGCCG 


GCGCCTGCGC 


CGGCGGGCTC 








ACCGTAGCCG 


CCTTGCTCGG 


CCACCTGCAG 


GTGCGCCGGC 


AACTGCGCAA 


GGTCAGTAGC 


J D W 




10 


GTCACCTACC 


TGGTCAGCCT 


GCTCGACAGC 


CAGATGGAAA 


GCCCGGCGAT 


GCTGTTCGrP 


J- U ^ \J 






GACGAGCAGA 


CCCTGGAGAG 


CAGCAAGCGC 


CGCTCCTACC 


AGCATGGCGT 


GCTGGAPGGG 

V3^_ X OVJrtV— VJVJVJ 


J. u o u 




15 


CGCGACATGG 


CCAAGGTGTT 


CGCCTGGATG 


CGCCCCAACG 


ACCTGATCTG 


GAAGTAPTGG 






GTCAACAACT 


ACCTGCTCGG 


CAGGCAGCCG 


CCGGCGTTCG 


ACATCCTCTA 


CTGGAACAAP 


1 9 n n 






GACAACACGC 


GGCTGCCCGC 


GGCGTTCCAC 


GGCGAACTGC 


TCGACCTGTT 


CAAGPAPAAP 


J.Z o u 




20 


CCGCTGACCC 


GCCCGGGCGC 


GCTGGAGGTC 


AGCGGGACCG 


CGGTGGACCT 


GGGCAAGGTG 

VJww^fV\wV7 X \J 


X J z u 






GCGATCGACA 


GCTTCCACGT 


CGCCGGCATC 


ACCGACCACA 


TCACGCCCTG 


GGAGGPGOTG 


1 'I p o 
X J o u 




25 


TATCGCTCGG 


CCCTCCTGCT 


GGGCGGCCAG 


CGCCGCTTCA 


TCCTGTCCAA 


\- VJ ^ VJ VJ VJ V- 


T A A n 


: 5 J 
11 j 


ATCCAGAGCA 


TCCTCAACCC 


TCCCGGAAAC 


CCCAAGGCCT 


GCTACTTCGA 


GAAPGAPAAG 


X o u u 




CTGAGCAGCG 


ATCCACGCGC 


CTGGTACTAC 


GACGCCAAGC 


GCGAAGAGGC; 


rAfiPTGGTGG 

V— X Ow i. WV7 


X 3 o u 


; !"r. 
\-$ s 
£ i 


30 


CCGGTCTGGC 


TGGGCTGGCT 


GCAGGAGCGC 


TCGGGCGAGC 


TGGGCAACCC 


TGArTTPAAP 


X o ^ u 






CTTGGCAGCG 


CCGCGCATCC 


GCCCCTCGAA 


GCGGCCCCGG 


GCACCTACGT 


GPATATAPGP 


icon 
X D o u 


fjl 


35 


TCAAAAGCTT 


TGGGCAAAGG 


TGTTACCGAG 


GAACAATTCA 


AAGAGACCTG 


GACGAGGCCG 


1740 


£ : : 


GGAGCTGCTG 


GAATGGGCGA 


AGGGACTAGC 


CTTGTGGTGG 


CCAAGTCCAG 


AATG 


1794 


O 




(2) INFORMATION FOR SEQ ID NO: 20 











40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Arg Glu Lys Gin Glu Ser Gly Ser Val Pro Val Pro Ala Glu Phe 
15 10 15 

^5 Met Ser Ala Gin Ser Ala He Val Gly Leu Arg Gly Lys Asp Leu Leu 

20 25 30 

-79- 



Thr Thr Val Arg Ser Leu Ala Val His Gly Leu Arg Gin Pro Leu His 
35 40 45 



Ser Ala Arg His Leu Val Ala Phe Gly Gly Gin Leu Gly Lys Val Leu 
50 55 60 

Leu Gly Asp Thr Leu His Gin Pro Asn Pro Gin Asp Ala Arg Phe Gin 
65 70 75 80 

Asp Pro Ser Trp Arg Leu Asn Pro Phe Tyr Arg Arg Thr Leu Gin Ala 
85 90 95 

Tyr Leu Ala Trp Gin Lys Gin Leu Leu Ala Trp lie Asp Glu Ser Asn 
100 105 110 

Leu Asp Cys Asp Asp Arg Ala Arg Ala Arg Phe Leu Val Ala Leu Leu 
115 120 125 

Ser Asp Ala Val Ala Pro Ser Asn Ser Leu lie Asn Pro Leu Ala Leu 
130 135 140 

Lys Glu Leu Phe Asn Thr Gly Gly He Ser Leu Leu Asn Gly Val Arg 
145 150 155 160 

His Leu Leu Glu Asp Leu Val His Asn Gly Gly Met Pro Ser Gin Val 
165 170 175 

Asn Lys Thr Ala Phe Glu He Gly Arg Asn Leu Ala Thr Thr Gin Gly 
180 185 190 

Ala Val Val Phe Arg Asn Glu Val Leu Glu Leu He Gin Tyr Lys Pro 
195 200 205 

Leu Gly Glu Arg Gin Tyr Ala Lys Pro Leu Leu lie Val Pro Pro Gin 
210 215 220 

He Asn Lys Tyr Tyr He Phe Asp Leu Ser Pro Glu Lys Ser Phe Val 
225 230 235 240 

Gin Tyr Ala Leu Lys Asn Asn Leu Gin Val Phe Val He Ser Trp Arg 
245 250 255 

Asn Pro Asp Ala Gin His Arg Glu Trp Gly Leu Ser Thr Tyr Val Glu 
260 265 270 

Ala Leu Asp Gin Ala He Glu Val Ser Arg Glu He Thr Gly Ser Arg 
275 280 285 

Ser Val Asn Leu Ala Gly Ala Cys Ala Gly Gly Leu Thr Val Ala Ala 
290 295 300 

Leu Leu Gly His Leu Gin Val Arg Arg Gin Leu Arg Lys Val Ser Ser 
305 310 315 320 



Val Thr Tyr Leu Val Ser Leu Leu Asp Ser Gin Met Glu Ser Pro Ala 

-80- 



325 330 335 

Met Leu Phe Ala Asp Glu Gin Thr Leu Glu Ser Ser Lys Arg Arg Ser 
340 345 350 

Tyr. Gin His Gly Val Leu Asp Gly Arg Asp Met Ala Lys Val Phe Ala 
355 360 365 

Trp Met Arg Pro Asn Asp Leu lie Trp Asn Tyr Trp Val Asn Asn Tyr 
370 375 380 

Leu Leu Gly Arg Gin Pro Pro Ala Phe Asp lie Leu Tyr Trp Asn Asn 
385 390 395 400 

Asp Asn Thr Arg Leu Pro Ala Ala Phe His Gly Glu Leu Leu Asp Leu 
405 410 415 

Phe Lys His Asn Pro Leu Thr Arg Pro Gly Ala Leu Glu Val Ser Gly 
420 425 430 

Thr Ala Val Asp Leu Gly Lys Val Ala lie Asp Ser Phe His Val Ala 
435 440 445 

Gly He Thr Asp His He Thr Pro Trp Asp Ala Val Tyr Arg Ser Ala 
450 455 460 

Leu Leu Leu Gly Gly Gin Arg Arg Phe He Leu Ser Asn Ser Gly His 
465 470 475 480 

He Gin Ser He Leu Asn Pro Pro Gly Asn Pro Lys Ala Cys Tyr Phe 
485 490 495 

Glu Asn Asp Lys Leu Ser Ser Asp Pro Arg Ala Trp Tyr Tyr Asp Ala 
500 505 510 

Lys Arg Glu Glu Gly Ser Trp Trp Pro Val Trp Leu Gly Trp Leu Gin 
515 520 525 

Glu Arg Ser Gly Glu Leu Gly Asn Pro Asp Phe Asn Leu Gly Ser Ala 
530 535 540 

Ala His Pro Pro Leu Glu Ala Ala Pro Gly Thr Tyr Val His He Arg 
545 550 555 560 

Ser Lys Ala Leu Gly Lys Gly Val Thr Glu Glu Gin Phe Lys Glu Thr 
565 570 575 

Trp Thr Arg Pro Gly Ala Ala Gly Met Gly Glu Gly Thr Ser Leu Val 
580 585 590 

Val Ala Lys Ser Arg Met 
595 

INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

-81- 



Ijj 



5= 



III 



55 



(A) L^^TH: 2 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: -linear 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GAATTCATGT CTCCAGTTGA TTTTAAAGAT AAAGTTGTGA TCATTACCGG TGCCGGTGGT 60 

GGTTTGGGTA AATACTACTC CCTCGAATTT GCCAAGTTGG GCGCCAAAGT CGTCGTTAAC 120 

15 

GACTTGGGTG GTGCCTTGAA CGGTCAAGGT GGAAACTCCA AGGCCGCCGA CGTTGTCGTT 180 

GACGAAATTG TCAAGAACGG TGGTGTTGCC GTTGCCGATT ACAAC7VACGT CTTGGACGGT 240 

20 GACAAGATTG TCGAAACCGC CGTCAAGAAC TTTGGTACTG TCCACGTTAT CATCAACAAT 3 00 

GCCGGTATCT TGAGAGATGC CTCCATGAAG AAGATGACTG AAAAAGACTA CAAATTGGTC 360 

^2 ATTGACGTGC ACTTGAACGG TGCCTTTGCC GTCACCAAGG CTGCTTGGCC ATACTTCCAA 420 

25 

AAGCAAAAAT ACGGTAGAAT TGTCAACACA TCCTCCCCAG CTGGTTTGTA CGGTAACTTT 480 

GGTCAAGCCA ACTACGCCTC CGCCAAGTCT GCTTTGTTGG GATTCGCTGA AACCTTGGCC 540 

30 AAGGAAGGTG CCAAATACAA CATCAAGGCC AACGCCATTG CTCCGTTGGC CAGATCAAGA 600 

l« ATGACTGAAT CTATCTTGCC ACCTCCAATG TTGGAAAAAT TGGGCCCTGA AAAGGTTGCC 660 

CCATTGGTCT TGTATTTGTC GTCAGCTGAA AACGAATTGA CTGGTCAATT CTTTGAAGTT 72 0 

GCTGCTGGCT TTTACGCTCA GATCAGATGG GAAAGATCCG GTGGTGTCTT GTTCAAGCCA 780 

GATCAATCCT TCACCGCTGA GGTTGTTGCT AAGAGATTCT CTGAAATCCT TGATTATGAC 840 

40 GACTCTAGGA AGCCAGAATA CTTGAAGAAC CAATACCCAT TCATGTTGAA CGACTACGCC 900 

ACTTTGACCA ACGAAGCTAG AAAGTTGCCA GCTAACGATG CTTCTGGTGC TCCAACTGTC 960 

TCCTTGAAGG ACAAGGTTGT TTTGATCACC GGTGCCGGTG CTGGTTTGGG TAAAGAATAC 1020 

45 

GCCAAGTGGT TCGCCAAGTA CGGTGCCAAG GTTGTTGTTA ACGACTTCAA GGATGCTACC 1080 

AAGACCGTTG ACGAAATCAA AGCCGCTGGT GGTGAAGCTT GGCCAGATCA ACACGATGTT 1140 

50 GCCAAGGACT CCGAAGCTAT CATCAAGAAT GTCATTGACA AGTACGGTAC CATTGATATC 1200 

TTGGTCAACA ACGCCGGTAT CTTGAGAGAC AGATCCTTTG CCAAGATGTC CAAGCAAGAA 1260 

TGGGACTCTG TCCAACAAGT CCACTTGATT GGTACTTTCA ACTTGAGCAG ATTGGCATGG 1320 



CCATACTTTG TTGAAAAACA ATTTGGTAGA ATCATCAACA TTACCTCCAC CAGTGGTATC 1380 

-82- 









i X oVj X L-AAvaC 


/■'A A /-I m "A /^m^/^ 

C AAC T AC T CG 


TCTTCTAAGG 


CTGGTATCTT 


GGGTTTGTCC 


1440 




5 




pr^a'PT'oa a^o 


rT^/~i f~^rr\ TV TV ^ A A rri 

TGCTAAGAAT 


AACATTAAGG 


TCAACATTGT 


TGCTCCACAC 


1500 








CCAHaALC rr 


GACCATCTTC 


AGAGAACAAG 


ACAAGAACTT 


GTACCACGCT 


1560 






f5 A n r* a zi nTTrrs 


C L- A i X V7 A T 


GGTCTACTTG 


GGTACTGACG 


ATGTCCCAGT 


CACCGGTGAA 


1620 




10 


APTTPPnaaa 


i CvjVj X (jCj X vjVj 


TTGGATCGGT 


AACACCAGAT 


GGCAAAGAGC 


CAAGGGTGCT 


1680 








a a a ^ TV TV 
AL-tjAACACAC 


CACTGTTGAA 


TTCATCAAGG 


AGCACTTGAA 


CGAAATCACT 


1740 




15 


n 2i PTTT* a rT" a 


U 1 VaACACTGA 


AAATCCAAAA 


TCTACCACCG 


AATCCTCCAT 


GGCTATCTTG 


1800 




i V- i v3V_v^^j i Ha 


vj i CjCj 1 GATGA 


CGATGATGAT 


GACGAAGACG 


AAGAAGAAGA 


CGAAGGTGAT 


1860 








TV /^/^ TV TV Tl H /-i T\ 

At- Cj AAG AAG A 


CGAAGAAGAA 


GACGATCCAG 


TCTGGAGATT 


CGACGACAGA 


1920 




20 




tgtacaacat 


TGCCCTTGGT 


GCCACCACCA 


AGCAATTGAA 


GTACGTCTAC 


1980 








C I CjACTTCCA 


AGTCATTCCA 


ACCTTTGGTC 


ACTTGATCAC 


CTTCAACTCT 


2040 




25 




A TV TV TV ^m/*^/ tfliiii 

AAAACTCCTT 


TGCCAAGTTG 


TTGCGTAACT 


TCAACCCAAT 


GTTGTTGTTG 


2100 


Wi 




ACTACTTGAA 


GGTGCACAGC 


TGGCCACCAC 


CAACCGAAGG 


TGAAATCAAG 


2160 


=1 i 
; Tsr 






AACCAATTGC 


CACTACTCCA 


AAGGGTACCA 


ACGTTGTTAT 


TGTTCACGGT 


2220 


Hie 


30 




1 1 •* A A A ^ TV TV 

TTGACAACAA 


GTCTGGTGAA 


TTGATTTACT 


CCAACGAAGC 


CACTTACTTC 


2280 






afo a a a a r^T" 


(jTCAAGCCGA 


CAACAAGGTC 


TACGCTGACC 


GTCCAGCATT 


CGCCACCAAC 


2340 


01 


35 


p a a ffr* TTr* r* 


OA/^^A A TV TV /~1 

V- AC C AAAG AG 


AGCCCCAGAC 


TACCAAGTTG 


ACGTTCCAGT 


CAGTGAAGAC 


2400 


5 w 
^. 3 


1 1 VTOV- 1 1 1 


xtjiAv-CGTTT 


GTCTGGTGAC 


AGAAACCCAT 


TGCACATTGA TCiCAAACTTT 


2460 


5 




ncvh. a arr*'!^/*' 


CCAAGTTCCC 


TAAGCCAATC 


TTACACGGTA 


TGTGCACTTA 


TGGTTTGAGT 


2520 




40 


n PT" a a rT^THT* 

VaC X AAvjo^- 1 i 


HaAl IGACAA 


GTTTGGTATG 


TTCAACGAAA 


TCAAGGCCAG 


ATTCACCGGT 


2580 






Ai HjJ.(»i ICC 


CAGGTGAAAC 


CTTGAGAGTC 


TTGGCATGGA 


AGGAAAGCGA 


TGACACTATT 


2640 




45 


GTCTTCCAAA 


CTCATGTTGT 


TGATAGAGGT 


ACTATTGCCA 


TTAACAACGC 


TGCTATTAAG 


2700 




TTAGTCGGTG 


ACAAAGCAAA 


GATCTAATGA 


AGGATCC 






2737 






(2) INFORMATION FOR SEQ ID NO: 22 











50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



-83- 



in 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

5 

Met Ser Pro Val Asp Phe Lys Asp Lys Val Val He He Thr Gly Ala 
15 10 15 

Gly Gly Gly Leu Gly Lys Tyr Tyr Ser Leu Glu Phe Ala Lys Leu Gly 
10 20 25 30 

Ala Lys Val Val Val Asn Asp Leu Gly Gly Ala Leu Asn Gly Gin Gly 
35 40 45 

.15 Gly Asn Ser Lys Ala Ala Asp Val Val Val Asp Glu He Val Lys Asn 

50 55 60 

Gly Gly Val Ala Val Ala Asp Tyr Asn Asn Val Leu Asp Gly Asp Lys 
65 70 75 80 

20 

He Val Glu Thr Ala Val Lys Asn Phe Gly Thr Val His Val He He 
85 90 95 

Asn Asn Ala Gly He Leu Arg Asp Ala Ser Met Lys Lys Met Thr Glu 
25 1 00 1 05 110 



s :5a 

¥1 30 



Lys Asp Tyr Lys Leu Val He Asp Val His Leu Asn Gly Ala Phe Ala 
115 120 125 

Val Thr Lys Ala Ala Trp Pro Tyr Phe Gin Lys Gin Lys Tyr Gly Arg 
130 135 140 

He Val Asn Thr Ser Ser Pro Ala Gly Leu Tyr Gly Asn Phe Gly Gin 
145 150 155 160 

0§ 35 

nJ Ala Asn Tyr Ala Ser Ala Lys Ser Ala Leu Leu Gly Phe Ala Glu Thr 

''M 165 170 175 

□ Leu Ala Lys Glu Gly Ala Lys Tyr Asn He Lys Ala Asn Ala He Ala 

40 180 185 190 

Pro Leu Ala Arg Ser Arg Met Thr Glu Ser He Leu Pro Pro Pro Met 
195 200 205 

45 Leu Glu Lys Leu Gly Pro Glu Lys Val Ala Pro Leu Val Leu Tyr Leu 

210 215 220 

Ser Ser Ala Glu Asn Glu Leu Thr Gly Gin Phe Phe Glu Val Ala Ala 
225 230 235 240 

50 

Gly Phe Tyr Ala Gin He Arg Trp Glu Arg Ser Gly Gly Val Leu Phe 
245 250 255 

Lys Pro Asp Gin Ser Phe Thr Ala Glu Val Val Ala Lys Arg Phe Ser 
55 260 265 270 



-84- 



Glu lie Lel^psp Tyr Asp Asp Ser Arg Lys Pro^Pa Tyr Leu Lys Asn 

^275 280 285 

Gin Tyr Pro Phe Met Leu Asn Asp Tyr Ala Thr Leu Thr Asn Glu Ala 
290 295 300 

Arg Lys Leu Pro Ala Asn Asp Ala Ser Gly Ala Pro Thr Val Ser Leu 
305 310 315 320 

Lys Asp Lys Val Val Leu lie Thr Gly Ala Gly Ala Gly Leu Gly Lys 
325 330 335 

Glu Tyr Ala Lys Trp Phe Ala Lys Tyr Gly Ala Lys Val Val Val Asn 
340 345 350 

Asp Phe Lys Asp Ala Thr Lys Thr Val Asp Glu lie Lys Ala Ala Gly 
355 360 365 

Gly Glu Ala Trp Pro Asp Gin His Asp Val Ala Lys Asp Ser Glu Ala 
370 375 380 

lie lie Lys Asn Val lie Asp Lys Tyr Gly Thr lie Asp lie Leu Val 
385 390 395 400 

Asn Asn Ala Gly lie Leu Arg Asp Arg Ser Phe Ala Lys Met Ser Lys 
405 410 415 

Gin Glu Trp Asp Ser Val Gin Gin Val His Leu He Gly Thr Phe Asn 
420 425 430 

Leu Ser Arg Leu Ala Trp Pro Tyr Phe Val Glu Lys Gin Phe Gly Arg 
435 440 445 

He He Asn He Thr Ser Thr Ser Gly He Tyr Gly Asn Phe Gly Gin 
450 455 460 

Ala Asn Tyr Ser Ser Ser Lys Ala Gly He Leu Gly Leu Ser Lys Thr 
465 470 475 480 

Met Ala He Glu Gly Ala Lys Asn Asn He Lys Val Asn He Val Ala 
485 490 495 

Pro His Ala Glu Thr Ala Met Thr Leu Thr He Phe Arg Glu Gin Asp 
500 505 510 

Lys Asn Leu Tyr His Ala Asp Gin Val Ala Pro Leu Leu Val Tyr Leu 
515 520 525 

Gly Thr Asp Asp Val Pro Val Thr Gly Glu Thr Ser Glu He Gly, Gly 
530 535 540 

Gly Trp He Gly Asn Thr Arg Trp Gin Arg Ala Lys Gly Ala Val Ser 
545 550 555 560 

His Asp Glu His Thr Thr Val Glu Phe He Lys Glu His Leu Asn Glu 
565 570 575 

-85- 



10 



lie Thr Asp Phe Thr Thr Asp Thr Glu Asn Pro Lys Ser Thr Thr Glu 
580 585 590 

Ser Ser Met Ala lie Leu Ser Ala Val Gly Gly Asp Asp Asp Asp Asp 
595 600 605 

Asp Glu Asp Glu Glu Glu Asp Glu Gly Asp Glu Glu Glu Asp Glu Glu 
610 615 620 

Asp Glu Glu Glu Asp Asp Pro Val Trp Arg Phe Asp Asp Arg Asp Val 
625 630 635 640 

lie Leu Tyr Asn lie Ala Leu Gly Ala Thr Thr Lys Gin Leu Lys Tyr 
15 645 • 650 655 

Val Tyr Glu Asn Asp Ser Asp Phe Gin Val lie Pro Thr Phe Gly His 
660 665 670 

20 Leu lie Thr Phe Asn Ser Gly Lys Ser Gin Asn Ser Phe Ala Lys Leu 

675 680 685 

Leu Arg Asn Phe Asn Pro Met Leu Leu Leu His Gly Glu His Tyr Leu 

690 695 700 

^0 25 

Ijl Lys Val His Ser Trp Pro Pro Pro Thr Glu Gly Glu lie Lys Thr Thr 

1T| 705 710 715 - 720 

jfs Phe Glu Pro He Ala Thr Thr Pro Lys Gly Thr Asn Val Val He Val 

Til 30 7 2 5 7 3 0 7 3 5 

' His Gly Ser Lys Ser Val Asp Asn Lys Ser Gly Glu Leu He Tyr Ser 

•U 740 745 750 

j 35 Asn Glu Ala Thr Tyr Phe lie Arg Asn Cys Gin Ala Asp Asn Lys Val 

755 760 765 



40 



55 



Tyr Ala Asp Arg Pro Ala Phe Ala Thr Asn Gin Phe Leu Ala Pro Lys 

770 775 780 

Arg Ala Pro Asp Tyr Gin Val Asp Val Pro Val Ser Glu Asp Leu Ala 

785 790 795 800 



Ala Leu Tyr Arg Leu Ser Gly Asp Arg Asn Pro Leu His He Asp Pro 
45 805 810 815 

Asn Phe Ala Lys Gly Ala Lys Phe Pro Lys Pro He Leu His Gly Met 
820 825 830 

50 Cys Thr Tyr Gly Leu Ser Ala Lys Ala Leu He Asp Lys Phe Gly Met 

835 840 845 

Phe Asn Glu He Lys Ala Arg Phe Thr Gly He Val Phe Pro Gly Glu 
850 855 860 



Thr Leu Arg Val Leu Ala Trp Lys Glu Ser Asp Asp Thr He Val Phe 

-86- 



865 



870 



875 



880 



10 



Gin Thr His Val Val Asp Arg Gly Thr He Ala He Asn Asn Ala Ala 
885 890 895 

He Lys Leu Val Gly Asp Lys Ala Lys He 
900 905 

(2) INFORMATION FOR SEQ ID NO: 23: 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 73 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 



?r 5 



30 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGATCCATGT CTCCAGTTGA TTTTAAAGAT AAAGTTGTGA TCATTACCGG TGCCGGTGGT 60 

25 GGTTTGGGTA AATACTACTC CCTCGAATTT GCCAAGTTGG GCGCCAAAGT CGTCGTTAAC 120 

GACTTGGGTG GTGCCTTGAA CGGTCAAGGT GGAAACTCCA AGGCCGCCGA CGTTGTCGTT 180 

GACGAAATTG TCAAGAACGG TGGTGTTGCC GTTGCCGATT ACAACAACGT CTTGGACGGT 240 

GACAAGATTG TCGAAACCGC CGTCAAGAAC TTTGGTACTG TCCACGTTAT CATCAACAAT 3 00 

GCCGGTATCT TGAGAGATGC CTCCATGAAG AAGATGACTG AAAAAGACTA CAAATTGGTC 360 

35 ATTGACGTGC ACTTGAACGG TGCCTTTGCC GTCACCAAGG CTGCTTGGCC ATACTTCCAA 420 

AAGCAAAAAT ACGGTAGAAT TGTCAACACA TCCTCCCCAG CTGGTTTGTA CGGTAACTTT 480 

GGTCAAGCCA ACTACGCCTC CGCCAAGTCT GCTTTGTTGG GATTCGCTGA AACCTTGGCC 540 

40 

AAGGAAGGTG CCAAATACTU^ CATCAAGGCC AACGCCATTG CTCCGTTGGC CAGATCAAGA 600 

ATGACTGAAT CTATCTTGCC ACCTCCAATG TTGGAAAAAT TGGGCCCTGA AAAGGTTGCC 660 

45 CCATTGGTCT TGTATTTGTC GTCAGCTGAA AACGAATTGA CTGGTCAATT CTTTGAAGTT 720 

GCTGCTGGCT TTTACGCTCA GATCAGATGG GAAAGATCCG GTGGTGTCTT GTTCAAGCCA 780 

GATCAATCCT TCACCGCTGA GGTTGTTGCT AAGAGATTCT CTGAAATCCT TGATTATGAC 840 

50 

GACTCTAGGA AGCCAGAATA CTTGAAGAAC CAATACCCAT TCATGTTGAA CGACTACGCC 900 

ACTTTGACCA ACGAAGCTAG AAAGTTGCCA GCTAACGATG CTTCTGGTGC TCCAACTGTC 960 

55 TCCTTGAAGG ACAAGGTTGT TTTGATCACC GGTGCCGGTG CTGGTTTGGG TAAAGAATAC 1020 

-87- 



J 



3 



10 



20 



50 



55 



TTAGTCGGTG ACAAJ^^VA GTTGTAATGA AGGATCC 273 7 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Ser Pro Val Asp Phe Lys Asp Lys Val Val He He Thr Gly Ala 
15 10 15 



Gly Gly Gly Leu Gly Lys Tyr Tyr Ser Leu Glu Phe Ala Lys Leu Gly 
20 25 30 



Ala Lys Val Val Val Asn Asp Leu Gly Gly Ala Leu Asn Gly Gin Gly 

n 35 40 45 

'% 25 

\f\ Gly Asn Ser Lys Ala Ala Asp Val Val Val Asp Glu He Val Lys Asn 

% 50 55 60 

Gly Gly Val Ala Val Ala Asp Tyr Asn Asn Val Leu Asp Gly Asp Lys 
30 6 5 7 0 7 5 8 0 

He Val Glu Thr Ala Val Lys Asn Phe Gly Thr Val His Val He He 
85 90 95 



35 Asn Asn Ala Gly He Leu Arg Asp Ala Ser Met Lys Lys Met . Thr Glu 

III 100 105 110 

Q Lys Asp Tyr Lys Leu Val He Asp Val His Leu Asn Gly Ala Phe Ala 

Q 115 120 125 

40 

Val Thr Lys Ala Ala Trp Pro Tyr Phe Gin Lys Gin Lys Tyr Gly Arg 
130 135 140 

He Val Asn Thr Ser Ser Pro Ala Gly Leu Tyr Gly Asn Phe Gly Gin 
45 145 150 155 160 

Ala Asn Tyr Ala Ser Ala Lys Ser Ala Leu Leu Gly Phe Ala Glu Thr 
165 170 175 



Leu Ala Lys Glu Gly Ala Lys Tyr Asn He Lys Ala Asn Ala He Ala 

180 185 190 

Pro Leu Ala Arg Ser Arg Met Thr Glu Ser He Leu Pro Pro Pro Met 

195 200 205 

Leu Glu Lys Leu Gly Pro Glu Lys Val Ala Pro Leu Val Leu Tyr Leu 
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210 



215 



Ser Ser Ala Glu Asn Glu Leu Thr Gly Gin Phe Phe Glu Val Ala Ala 
225 230 235 240 

5 

Gly Phe Tyr Ala Gin He Arg Trp Glu Arg Ser Gly Gly Val Leu Phe 
245 250 255 

Lys Pro Asp Gin Ser Phe Thr Ala Glu Val Val Ala Lys Arg Phe Ser 
10 260 265 270 

Glu lie Leu Asp Tyr Asp Asp Ser Arg Lys Pro Glu Tyr Leu Lys Asn 
275 280 285 

15 Gin Tyr Pro Phe Met Leu Asn Asp Tyr Ala Thr Leu Thr Asn Glu Ala 

290 295 300 

Arg Lys Leu Pro Ala Asn Asp Ala Ser Gly Ala Pro Thr Val Ser Leu 
305 310 315 320 

20 

Lys Asp Lys Val Val Leu He Thr Gly Ala Gly Ala Gly Leu Gly Lys 
325 330 335 

Q Glu Tyr Ala Lys Trp Phe Ala Lys Tyr Gly Ala Lys Val Val Val Asn 

^0 25 340 345 350 

m Asp Phe Lys Asp Ala Thr Lys Thr Val Asp Glu He Lys Ala Ala Gly 

jil 355 360 ^ 365 

Til 

7] 30 Gly Glu Ala Trp Pro Asp Gin His Asp Val Ala Lys Asp Ser Glu Ala 

370 375 380 

^L, He lie Lys Asn Val He Asp Lys Tyr Gly Thr He Asp He Leu Val 

O 385 390 395 400 

P 35 

rJ Asn Asn Ala Gly He Leu Arg Asp Arg Ser Phe Ala Lys Met Ser Lys 

405 410 415 

13 

13 Gin Glu Trp Asp Ser Val Gin Gin Val His Leu He Gly Thr Phe Asn 
40 4 2 0 4 2 5 4 3 0 

Leu Ser Arg Leu Ala Trp Pro Tyr Phe Val Glu Lys Gin Phe Gly Arg 
435 440 445 

45 He He Asn He Thr Ser Thr Ser Gly He Tyr Gly Asn Phe Gly Gin 

450 455 460 

Ala Asn Tyr Ser Ser Ser Lys Ala Gly He Leu Gly Leu Ser Lys Thr 
465 470 475 480 

50 

Met Ala He Glu Gly Ala Lys Asn Asn He Lys Val Asn He Val Ala 
485 490 495 

Pro His Ala Glu Thr Ala Met Thr Leu Thr He Phe Arg Glu Gin Asp 
55 5 0 0 5 0 5 5 1 0 
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Lys Asn LeiJ 
515 




X His Ala Asp Gin Val Ala Pro 
520 




Leu Val Tyr Leu 
525 



Gly Thr Asp Asp Val Pro Val Thr Gly Glu Thr Ser Glu He Gly Gly 
530 535 540 

Gly Trp He Gly Asn Thr Arg Trp Gin Arg Ala Lys Gly Ala Val Ser 
545 550 555 560 

His Asp Glu His Thr Thr Val Glu Phe He Lys Glu His Leu Asn Glu 
565 570 575 

He Thr Asp Phe Thr Thr Asp Thr Glu Asn Pro Lys Ser Thr Thr Glu 
580 585 590 

Ser Ser Met Ala He Leu Ser Ala Val Gly Gly Asp Asp Asp Asp Asp 
595 600 605 

Asp Glu Asp Glu Glu Glu Asp Glu Gly Asp Glu Glu Glu Asp Glu Glu 
610 615 620 

Asp Glu Glu Glu Asp Asp Pro Val Trp Arg Phe Asp Asp Arg Asp Val 
625 630 635 640 

He Leu Tyr Asn He Ala Leu Gly Ala Thr Thr Lys Gin Leu Lys Tyr 
645 650 655 

Val Tyr Glu Asn Asp Ser Asp Phe Gin Val He Pro Thr Phe Gly His 
660 665 670 

Leu He Thr Phe Asn Ser Gly Lys Ser Gin Asn Ser Phe Ala Lys Leu 
675 680 685 

Leu Arg Asn Phe Asn Pro Met Leu Leu Leu His Gly Glu His Tyr Leu 
690 695 700 

Lys Val His Ser Trp Pro Pro Pro Thr Glu Gly Glu He Lys Thr Thr 
705 710 715 720 

Phe Glu Pro He Ala Thr Thr Pro Lys Gly Thr Asn Val Val He Val 
725 730 735 

His Gly Ser Lys Ser Val Asp Asn Lys Ser Gly Glu Leu He Tyr Ser 
740 745 750 

Asn Glu Ala Thr Tyr Phe He Arg Asn Cys Gin Ala Asp Asn Lys Val 



Arg Ala Pro Asp Tyr Gin Val Asp Val Pro Val Ser Glu Asp Leu Ala 
785 790 795 800 

Ala Leu Tyr Arg Leu Ser Gly Asp Arg Asn Pro Leu His He Asp Pro 



755 



760 



765 



Tyr Ala Asp Arg Pro Ala Phe Ala Thr Asn Gin Phe Leu Ala Pro Lys 
770 775 780 



805 



810 



815 
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y i 



0? 



10 



55 



Asn Phe Ala Lys Gly Ala Lys Phe Pro Lys Pro He Leu His Gly Met 
820 825 830 

Cys Thr Tyr Gly Leu Ser Ala Lys Ala Leu He Asp Lys Phe Gly Met 
835 840 845 

Phe Asn Glu He Lys Ala Arg Phe Thr Gly He Val Phe Pro Gly Glu 
850 855 860 

Thr Leu Arg Val Leu Ala Trp Lys Glu Ser Asp Asp Thr He Val Phe 
865 870 875 880 



Gin Thr His Val Val Asp Arg Gly Thr He Ala He Asn Asn Ala Ala 
15 885 890 895 

He Lys Leu Val Gly Asp Lys Ser Lys Leu 
900 905 

20 (2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2737 base pairs 
□ (B) TYPE: nucleic acid 

=13 25 (C) STRANDEDNESS : single 

iJI (D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 25: 

GGATCCATGT CTCCAGTTGA TTTTAAAGAT AAAGTTGTGA TCATTACCGG TGCCGGTGGT 60 

35 

GGTTTGGGTA AATACTACTC CCTCGAATTT GCCAAGTTGG GCGCCAAAGT CGTCGTTAAC " 12 0 

GACTTGGGTG GTGCCTTGAA CGGTCAAGGT GGAAACTCCA AGGCCGCCGA CGTTGTCGTT 180 

40 GACGAAATTG TCAAGAACGG TGGTGTTGCC GTTGCCGATT ACAACAACGT CTTGGACGGT 24 0 

GACAAGATTG TCGAAACCGC CGTCAAGAAC TTTGGTACTG TCCACGTTAT CATCAACAAT 300 

GCCGGTATCT TGAGAGATGC CTCCATGAAG AAGATGACTG AAAAAGACTA CAAATTGGTC 36 0 

45 

ATTGACGTGC ACTTGAACGG TGCCTTTGCC GTCACCAAGG CTGCTTGGCC ATACTTCCAA 420 

AAGCAAAAAT ACGGTAGAAT TGTCAACACA TCCTCCCCAG CTGGTTTGTA CGGTAACTTT 4 80 

50 GGTCAAGCCA ACTACGCCTC CGCCAAGTCT GCTTTGTTGG GATTCGCTGA AACCTTGGCC 540 

AAGGAAGGTG CCAAATACAA CATCAAGGCC AACGCCATTG CTCCGTTGGC CAGATCAAGA 600 

ATGACTGAAT CTATCTTGCC ACCTCCAATG TTGGAAAAAT TGGGCCCTGA AAAGGTTGCC 660 



CCATTGGTCT TGTATTTGTC GTCAGCTGAA AACGAATTGA CTGGTCAATT CTTTGAAGTT 72 0 

-92- 



10 



15 



20 



o 



7 1 30 



ijl 35 



40 



45 



50 



55 



GCTGCTGGCT 


TTTACGCTCA 


GATCAGATGG 


GAAAGATCCG 


GTGGTGTCTT 


GTTCAAGCCA 


780 


GATCAATCCT 


TCACCGCTGA 


GGTTGTTGCT 


AAGAGATTCT 


CTGAAATCCT 


TGATTATGAC 


840 


GACTCTAGGA 


AGCCAGAATA 


CTTGAAGAAC 


CAATACCCAT 


TCATGTTGAA 


CGACTACGCC 


900 


ACTTTGACCA 


ACGAAGCTAG 


AAAGTTGCCA 


GCTAACGATG 


CTTCTGGTGC 


TCCAACTGTC 


960 


TCCTTGAAGG 


ACAAGGTTGT 


TTTGATCACC 


GGTGCCGGTG 


CTGGTTTGGG 


TAAAGAATAC 


1020 


GCCAAGTGGT 


TCGCCAAGTA 


CGGTGCCAAG 


GTTGTTGTTA 


ACGACTTCAA 


GGATGCTACC 


1080 


AAGACCGTTG 


ACGAAATCAA 


AGCCGCTGGT 


GGTGAAGCTT 


GGCCAGATCA 


ACACGATGTT 


1140 


GCCAAGGACT 


CCGAAGCTAT 


CATCAAGAAT 


GTCATTGACA 


AGTACGGTAC 


CATTGATATC 


1200 


TTGGTCAACA 


ACGCCGGTAT 


CTTGAGAGAC 


AGATCCTTTG 


CCAAGATGTC 


CAAGCAAGAA 


1260 


TGGGACTCTG 


TCCAACAAGT 


CCACTTGATT 


GGTACTTTCA 


ACTTGAGCAG 


ATTGGCATGG 


1320 


CCATACTTTG 


TTGAAAAACA 


ATTTGGTAGA 


ATCATCAACA 


TTACCTCCAC 


CAGTGGTATC 


1380 


TACGGTAACT 


TTGGTCAAGC 


CAACTACTCG 


TCTTCTAAGG 


CTGGTATCTT 


GGGTTTGTCC 


1440 


AAGACCATGG 


CCATTGAAGG 


TGCTAAGAAT 


AACATTAAGG 


TCAACATTGT 


TGCTCCACAC 


1500 


GCTGAAACTG 


CCATGACCTT 


GACCATCTTC 


AGAGAACAAG 


ACAAGAACTT 


GTACCACGCT 


1560 


GACCAAGTTG 


CTCCATTGTT 


GGTCTACTTG 


GGTACTGACG 


ATGTCCCAGT 


CACCGGTGAA 


1620 


ACTTCCGAAA 


TCGGTGGTGG 


TTGGATCGGT 


AACACCAGAT 


GGCAAAGAGC 


CAAGGGTGCT 


1680 


GTCTCCCACG 


ACGAACACAC 


CACTGTTGAA 


TTCATCAAGG 


AGCACTTGAA 


CGAAATCACT 


1740 


GACTTCACCA 


CTGACACTGA 


AAATCCAAAA 


TCTACCACCG 


AATCCTCCAT 


GGCTATCTTG 


1800 


TCTGCCGTTG 


GTGGTGATGA 


CGATGATGAT 


GACGAAGACG 


AAGAAGAAGA 


CGAAGGTGAT 


1860 


GAAGAAGAAG 


ACGAAGAAGA 


CGAAGAAGAA 


GACGATCCAG 


TCTGGAGATT 


CGACGACAGA 


1920 


GATGTTATCT 


TGTACAACAT 


TGCCCTTGGT 


GCCACCACCA 


AGCAATTGAA 


GTACGTCTAC 


1980 


GAAAACGACT 


CTGACTTCCA 


AGTCATTCCA 


ACCTTTGGTC 


ACTTGATCAC 


CTTCAACTCT 


2040 


GGTAAGTCAC 


AAAACTCCTT 


TGCCAAGTTG 


TTGCGTAACT 


TCAACCCAAT 


GTTGTTGTTG 


2100 


CACGGTGAAC 


ACTACTTGAA 


GGTGCACAGC 


TGGCCACCAC 


CAACCGAAGG 


TGAAATCAAG 


2160 


ACCACTTTCG 


AACCAATTGC 


CACTACTCCA 


AAGGGTACCA 


ACGTTGTTAT 


TGTTCACGGT 


2220 


TCCAAATCTG 


TTGACAACAA 


GTCTGGTGAA 


TTGATTTACT 


CCAACGAAGC 


CACTTACTTC 


2280 


ATCAGAAACT 


GTCAAGCCGA 


CAACAAGGTC 


TACGCTGACC 


GTCCAGCATT 


CGCCACCAAC 


2340 


CAATTCTTGG 


CACCAAAGAG 


AGCCCCAGAC 


TACCAAGTTG 


ACGTTCCAGT 


CAGTGAAGAC 


2400 
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^cOTttt gtctggtgac agaaacccat tgcacattgj 



TTGGCTGCTT TGTACfTOTTT GTCTGGTGAC AGAAACCCAT TGCACATTGA TCCAAACTTT 2460 
GCTAAAGGTG CCAAGTTCCC TAAGCCAATC TTACACGGTA TGTGCACTTA TGGTTTGAGT 2 52 0 

5 

GCTAAGGCTT TGATTGACAA GTTTGGTATG TTCAACGAAA TCAAGGCCAG ATTCACCGGT 2580 
ATTGTCTTCC CAGGTGAAAC CTTGAGAGTC TTGGCATGGA AGGAAAGCGA TGACACTATT 2640 
10 GTCTTCCAAA CTCATGTTGT TGATAGAGGT ACTATTGCCA TTAACAACGC TGCTATTAAG 2700 
TTAGTCGGTG ACAAATGAAA GATCGAATGA AGGATCC 2 737 

(2) INFORMATION FOR SEQ ID NO : 26: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 903 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

yi 

^ Met Ser Pro Val Asp Phe Lys Asp Lys Val Val He He Thr Gly Ala 

j'U 1 5 10 15 

Ijl 30 

yj Gly Gly Gly Leu Gly Lys Tyr Tyr Ser Leu Glu Phe Ala Lys Leu Gly 

=P 20 25 30 

Q Ala Lys Val Val Val Asn Asp Leu Gly Gly Ala Leu Asn Gly Gin Gly 

m 35 3 5 4 0 4 5 

Gly Asn Ser Lys Ala Ala Asp Val Val Val Asp Glu He Val Lys Asn 
£^ 50 55 60 

40 Gly Gly Val Ala Val Ala Asp Tyr Asn Asn Val Leu Asp Gly Asp Lys 

65 70 75 80 

He Val Glu Thr Ala Val Lys Asn Phe Gly Thr Val His Val He He 
85 90 95 

45 

Asn Asn Ala Gly He Leu Arg Asp Ala Ser Met Lys Lys Met Thr Glu 
100 105 110 

Lys Asp Tyr Lys Leu Val He Asp Val His Leu Asn Gly Ala Phe Ala 
50 115 1 20 1 25 

Val Thr Lys Ala Ala Trp Pro Tyr Phe Gin Lys Gin Lys Tyr Gly Arg 
130 135 140 

55 He Val Asn Thr Ser Ser Pro Ala Gly Leu Tyr Gly Asn Phe Gly Gin 

145 150 155 160 
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5 



S . 3 



10 



40 



50 



55 



Ala Asn Tyr Ala Ser Ala Lys Ser Ala Leu Leu Gly Phe Ala Glu Thr 
165 170 175 

Leu Ala Lys Glu Gly Ala Lys Tyr Asn lie Lys Ala Asn Ala lie Ala 
180 185 190 

Pro Leu Ala Arg Ser Arg Met Thr Glu Ser He Leu Pro Pro Pro Met 
195 200 205 

Leu Glu Lys Leu Gly Pro Glu Lys Val Ala Pro Leu Val Leu Tyr Leu 
210 215 220 

Ser Ser Ala Glu Asn Glu Leu Thr Gly Gin Phe Phe Glu Val Ala Ala 
15 225 230 235 240 

Gly Phe Tyr Ala Gin He Arg Trp Glu Arg Ser Gly Gly Val Leu Phe 
245 250 255 

20 • Lys Pro Asp Gin Ser Phe Thr Ala Glu Val Val Ala Lys Arg Phe Ser 

260 265 270 

.^^ Glu lie Leu Asp Tyr Asp Asp Ser Arg Lys Pro Glu Tyr Leu Lys Asn 

275 280 285 

'^l4 25 

Ul Gin Tyr Pro Phe Met Leu Asn Asp Tyr Ala Thr Leu Thr Asn Glu Ala 

IB 290 295 300 



Arg Lys Leu Pro Ala Asn Asp Ala Ser Gly Ala Pro Thr Val Ser Leu 
30 305 310 315 320 



. Lys Asp Lys Val Val Leu He Thr Gly Ala Gly Ala Gly Leu Gly Lys 

n 325 330 335 

Jij 35 Glu Tyr Ala Lys Trp Phe Ala Lys Tyr Gly Ala Lys Val Val Val Asn 

ll 340 345 350 



Asp Phe Lys Asp Ala Thr Lys Thr Val Asp Glu lie Lys Ala Ala Gly 
355 360 365 

Gly Glu Ala Trp Pro Asp Gin His Asp Val Ala Lys Asp Ser Glu Ala 
370 375 380 

He He Lys Asn Val He Asp Lys Tyr Gly Thr He Asp He Leu Val 
45 385 390 395 400 

Asn Asn Ala Gly He Leu Arg Asp Arg Ser Phe Ala Lys Met Ser Lys 
405 410 415 



Gin Glu Trp Asp Ser Val Gin Gin Val His Leu He Gly Thr Phe Asn 
420 425 430 

Leu Ser Arg Leu Ala Trp Pro Tyr Phe Val Glu Lys Gin Phe Gly Arg 
435 440 445 

He He Asn He Thr Ser Thr Ser Gly He Tyr Gly Asn Phe Gly Gin 
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35 



450 



455 



Ala Asn Tyr Ser Ser Ser Lys Ala Gly lie Leu Gly Leu Ser Lys Thr 
465 470 475 480 

5 

Met Ala lie Glu Gly Ala Lys Asn Asn He Lys Val Asn He Val Ala 
485 490 495 

Pro His Ala Glu Thr Ala Met Thr Leu Thr He Phe Arg Glu Gin Asp 
10 500 505 510 

Lys Asn Leu Tyr His Ala Asp Gin Val Ala Pro Leu Leu Val Tyr Leu 
515 520 525 

15 Gly Thr Asp Asp Val Pro Val Thr Gly Glu Thr Ser Glu He Gly Gly 

530 535 540 

Gly Trp He Gly Asn Thr Arg Trp Gin Arg Ala Lys Gly Ala Val Ser 
545 550 555 560 

20 

His Asp Glu His Thr Thr Val Glu Phe He Lys Glu His Leu Asn Glu 
565 570 575 

13 He Thr Asp Phe Thr Thr Asp Thr Glu Asn Pro Lys Ser Thr Thr Glu 

=0 25 580 585 590 

m Ser Ser Met Ala He Leu Ser Ala Val Gly Gly Asp Asp Asp Asp Asp 

jVj 595 600 605 

30 Asp Glu Asp Glu Glu Glu Asp Glu Gly Asp Glu Glu Glu Asp Glu Glu 

'^'^ 610 615 620 

Asp Glu Glu Glu Asp Asp Pro Val Trp Arg Phe Asp Asp Arg Asp Val 
625 630 635 640 

He Leu Tyr Asn He Ala Leu Gly Ala Thr Thr Lys Gin Leu Lys Tyr 
645 650 655 



Val Tyr Glu Asn Asp Ser Asp Phe Gin Val He Pro Thr Phe Gly His 
40 660 665 670 

Leu He Thr Phe Asn Ser Gly Lys Ser Gin Asn Ser Phe Ala Lys Leu 
675 680 685 

45 Leu Arg Asn Phe Asn Pro Met Leu Leu Leu His Gly Glu His Tyr Leu 

690 695 700 

Lys Val His Ser Trp Pro Pro Pro Thr Glu Gly Glu He Lys Thr Thr 
705 710 715 720 

50 

Phe Glu Pro He Ala Thr Thr Pro Lys Gly Thr Asn Val Val He Val 
725 730 735 

His Gly Ser Lys Ser Val Asp Asn Lys Ser Gly Glu Leu He Tyr Ser 
55 7 4 0 7 4 5 7 5 0 
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Asn Glu AlH 

755 




ir Tyr Phe lie Arg Asn Cys Gin 
760 




Asp Asn Lys Val 
765 



Tyr Ala. Asp Arg Pro Ala Phe Ala Thr Asn Gin Phe Leu Ala Pro Lys 
770 775 780 

Arg Ala Pro Asp Tyr Gin Val Asp Val Pro Val Ser Glu Asp Leu Ala 
785 790 795 800 

Ala Leu Tyr Arg Leu Ser Gly Asp Arg Asn Pro Leu His He Asp Pro 
805 810 815 

Asn Phe Ala Lys Gly Ala Lys Phe Pro Lys Pro He Leu His Gly Met 
820 825 830 

Cys Thr Tyr Gly Leu Ser Ala Lys Ala Leu He Asp Lys Phe Gly Met 
835 840 845 

Phe Asn Glu He Lys Ala Arg Phe Thr Gly He Val Phe Pro Gly Glu 
850 855 860 

Thr Leu Arg Val Leu Ala Trp Lys Glu Ser Asp Asp Thr He Val Phe 
865 870 875 880 

Gin Thr His Val Val Asp Arg Gly Thr He Ala He Asn Asn Ala Ala 
885 890 895 

He Lys Leu Val Gly Asp Lys 



900 
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